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The microbial conversion of solid cellulosic biomass to liquid biofuels may provide a 
renewable energy source for transportation fuels. Cellulolytic fungi represent a promising 
group of organisms, as they have evolved complex systems for adaptation to their 
natural habitat. The filamentous fungus Myceliophthora thermophila constitutes an 
exceptionally powerful cellulolytic microorganism that synthesizes a complete set of 
enzymes necessary for the breakdown of plant cell wall. The genome of this fungus 
has been recently sequenced and annotated, allowing systematic examination and 
identification of enzymes required for the degradation of lignocellulosic biomass. The 
genomic analysis revealed the existence of an expanded enzymatic repertoire including 
numerous cellulases, hemicellulases, and enzymes with auxiliary activities, covering 
the most of the recognized CAZy families. Most of them were predicted to possess 
a secretion signal and undergo through post-translational glycosylation modifications. 
These data offer a better understanding of activities embedded in fungal lignocellulose 
decomposition mechanisms and suggest that M. thermophila could be made usable as an 
industrial production host for cellulolytic and hemicellulolytic enzymes. 
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INTRODUCTION 

Ethanol production from lignocellulosic biomass, comprised pri- 
marily of cellulose and hemicellulose, appears to evolve as one 
of the most important technologies for sustainable development. 
Given its renewable nature, biomass is a potential raw mate- 
rial not only for the production of biofuels, but also chemicals, 
energy and other materials of main industrial interest (Zhang 
et al., 2006). The monosaccharides contained in the cellulosic 
(glucose) and hemicellulosic fractions (xylose, arabinose, man- 
nose, and galactose) represent substrates that can be used for 
ethanol production via fermentation. To initiate the degradation 
of these fractions, it is necessary to overcome the physical and 
chemical barriers presented by the cohesive combination of the 
main biomass components, which hinders the hydrolysis of cellu- 
lose and hemicellulose into fermentable sugars. The above include 
high substrate viscosity, poor mass transfer conditions and long 
reaction times, during which hydrolysis reactors are susceptible 
to contamination. Fungi are the main decomposers of lignocel- 
lulosic biomass in terrestrial ecosystems and the enzymes they 
secrete to break down lignocellulose may be useful in industrial 
processes. Thermophilic fungi provide a potential source of plant 
cell wall degrading enzymes with higher levels of specific activity 
and better stability at higher temperatures, thus making it feasible 
to minimize the hydrolysis time, reduce substrate viscosity and 
contamination levels (Margaritis and Merchant, 1986). 

Myceliophthora thermophila (synonym Sporotrichum ther- 
mophile) is a thermophilic filamentous fungus, classified as an 



ascomycete, which was isolated from soil in eastern Russia 
and constitutes an exceptionally powerful cellulolytic organism, 
which synthesizes a complete set of enzymes necessary for the 
breakdown of cellulose. The growth rate and cell density of 
this microorganism appear to be similar in media containing 
cellulose or glucose (Bhat and Maheshwari, 1987). The 38.7 
Mbp genome of M. thermophila, comprising about 9500 genes, 
organized in 7 chromosomes, has been sequenced and anno- 
tated (Joint Genome Institute, University of California, http:// 
genome.jgi-psf.org; Berka et al, 2011). It revealed a large num- 
ber of genes putatively encoding industrially important enzymes, 
such as carbohydrate-active enzymes (CAZy), proteases, oxido- 
reductases, and lipases, while more than 200 sequences have 
been identified exclusively for plant cell-wall-degrading enzymes. 
These sequences encode a large number of glycoside hydrolases 
(GH) and polysaccharide lyases, covering the most of the recog- 
nized families (Table 1). In addition, M. thermophila was devel- 
oped into a proprietary mature enzyme production system with 
easy scaling (CI strain; Visser et al., 2011). The main features of 
CI are the high production levels (up to 100 g/L protein), as well 
as the maintenance of low viscosity levels of the culture medium, 
thus enabling fermentation process to reach very high densities. 

M. thermophila exhibits an impressing number of accessory 
enzymes belonging to AA9 (previously described as GH61) and 
family 1 carbohydrate binding modules (CBM), which are the 
highest found in fungi (Berka et al., 201 1). Family 1 CBM presents 
a cellulose-binding function and is almost exclusively found in 



www.frontiersin.org 



June 2014 | Volume 5 | Article 281 | 1 



Karnaouri et al. 



The lignocellulolytic system of Myceliophthora thermophila 



Table 1 | Number of predicted CAZymes encoded in the genome of 
M. thermophila. 



Specific activity 


CAZy module(s) 


No. id. seq. 


Cellulases 


Endoglucanases 


GH 5, 7, 12, 45 


8 




Cellobiohydrolases 


GH 6, 7 


7 




f3-glucosidases 


GH 1, 3 


8 




LPMOs 


AA9 


25 


Xylanases 


Xylanases 


GH 10, 11 


12 




Xylosidases 


GH 3, 43 


4 


Arabinases 


Endoarabinases 


GH 43 


3 




Exo-arabinases/ 


GH 43, 51, 62 


11 




arabinofuranosidases 






Mannanases 


Endomannanases 


GH 5, 26 


3 




Mannosidases 


GH 2 


2 


Pectinases 


Polygalacturonases 


GH28 


2 




Rhamnosidases 


GH78 


1 




Pectin lyases 


PL 1, 3, 4, 20 


8 




Pectin esterases 


CE 8, 12 


4 


Esterases 


Feruloyl esterases 


CE 1 


4 




Acetyl esterases 


CE 3, 5, 16 


8 




Acetylmannan 


CE 12 


2 




esterases 








Glycuronoyl 


CE 15 


2 




esterases 







GHs, Glycoside hydrolases; CEs, carbohydrate esterases; and PLs, 
polysaccharide lyases are included, covering the most of the recognized 
families. 



enzymes of fungal origin (http://www.cazy.org; Guillen et al., 
2009). In addition, M. thermophila distinguishes itself from other 
cellulolytic fungi, such as Aspergillus niger and Trichoderma ree- 
sei by the presence of a relatively high number of (glucurono) 
arabinoxylan degrading enzymes (Hinz et al, 2009). Eleven 
putative xylanases were found that belong into GH 10 and 11 
families compared to five in both A. niger (Broad Institute of 
Harvard and MIT, http://www.broadinstitute.org) and T. reesei 
(Joint Genome Institute, University of California, http://genome. 
jgi-psf.org), while 14 arabinofuranosidases belonging to GH 43, 
51, and 62 families were found compared to 13 in A. niger and 
three in T. reesei, rendering M. thermophila a promising source 
of hemicellulolytic enzymes. Studying the secretome of M. ther- 
mophila after 30 h of growth in barley and alfalfa straws, it was 
found to comprise of 683 predicted proteins, 230 of which are 
proteins with unknown function (Berka et al., 2011). Based on 
transcriptome analysis, many secreted enzymes including acces- 
sory enzymes, hypothetical proteins and proteins with unknown 
function were upregulated, when the fungus is grown in more 
complex substrates, such as agricultural straws, compared to glu- 
cose, indicating their crucial role in lignocellulose degradation 
(Berka et al, 2011). 

M. thermophila grows in temperatures between 25 and 
55° C, while a relative growth performance study on myco- 
broth agar plates indicated that the optimum condition is at 
45°C (Morgenstern et al., 2012). The temperature optima for 



several enzymes with the same specific activity, characterized 
from M. thermophila, range from 50 to 70° C. For example, 
StEG5 endoglucanase, expressed in A. niger, exhibits a T opt of 
70° C (Tambor et al, 2012), while recombinant MtEG7 expressed 
in Pichia pastoris exhibited an optimal temperature of 60° C 
(Karnaouri et al., 2014). The same characteristic is also observed 
for M. thermophila xylanases expressed in A. niger, showing opti- 
mal activity at temperatures between 50 and 70°C (Berka et al., 
2011), underpinning the enzymatic potential that is not only 
diverse in catalytic activities, but also in properties increasing its 
efficiency in various temperatures. 

Individual cellulolytic enzymes exhibit comparable activities 
on cellulose; however, synthetically composed multienzyme mix- 
tures display a much higher performance than those from other 
lignocellulolytic thermostable fungi (Szijarto et al., 2011; Zhang 
et al., 2013). This can be attributed to synergistic mode of action 
between the enzymes. For example, synergism between GH 1 1 
xylanase and type C feruloyl esterase has been proved (Moukouli 
et al., 2010), as well as between cellobiohydrolases acting on 
the reducing and the non-reducing end of cellulose molecules 
(Gusakov et al, 2007). 

In this review, an overview will be given of the cellulolytic 
and hemicellulolytic potential of M. thermophila regarding the 
degradation of plant cell wall material. The genomic potential 
of this thermophilic fungus demonstrates a strong enzymatic 
toolbox including hydrolytic, oxidative and accessory activities 
that may enhance its ability to decompose plant biomass. Many 
of these enzymes have been isolated from culture supernatant 
or selectively overexpressed in M. thermophila (CI strain) or 
in other heterologous hosts and have been characterized. All 
sequences used in this study were extracted from Genome Portal 
database (http://genome.jgi-psf.org) and the continually updated 
CAZy database (http://www.cazy.org/; Lombard et al., 2014). The 
conserved domains were found with Pfam/InterProscan (http:// 
pfam.sanger.ac.uk/; Punta et al., 2012), while the theoretical 
molecular mass and isoelectric point for each protein were cal- 
culated using the ProtParam tool of ExPASY (http://web.expasy. 
org/protparam/). Post-translational glycosylation sites were pre- 
dicted with NetNGlyc 1.0 server (http://www.cbs.dtu.dk/services/ 
NetNGlyc/) and NetOGlyc 3.1 server (http://www.cbs.dtu.dk/ 
services/NetOGlyc/). Predicted secretome was extracted using 
SignalP v4.0 (http://www.cbs.dtu.dk/services/SignalP/). 

CELLULOLYTIC SYSTEM 

Cellulose is composed of p 1 -D-anhydroglucopyranose units linked 
by (l,4)-glycosidic bonds. Polymorphism or allotropy refers to 
the existence of more than one crystalline forms differing in phys- 
ical and chemical properties. Cellulose degradation is attributed 
to the synergistic action of three complementary enzyme activ- 
ities: (1) endoglucanases (EGs, EC 3.2.1.4); (2) exoglucanases, 
including cellodextrinases (EC 3.2.1.74) and cellobiohydrolases 
(CBHs, EC 3.2.1.91 for the non-reducing end acting CBHs and 
EC 3.2.1.176 for the reducing end acting ones) and (3) f5- 
glucosidases (BGs, EC 3.2.1.21) (Lynd et al., 2002). Amorphous 
regions of the polysaccharide chain are cleaved randomly by EGs, 
while CBHs remove processively cellooligosaccharides from chain 
ends. The latter are the most abundant enzymes in the secretome 
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of cellulolytic fungi (Jun et al., 2011; Ribeiro et al, 2012). Their 
main representatives are GH family 7 (CBH I) that attack the 
reducing end of a cellulose chain and GH family 6 (CBH II) 
that are specific toward the non-reducing end of the chain. Until 
very recently, CBHs were considered as the main degraders of the 
crystalline part of cellulose (Sweeney and Xu, 2012). 

EGs are widespread among GH families, with examples 
described for families 5-9, 12, 44, 45, 48, 51, and 74 on the con- 
tinually updated CAZy database (http://www.cazy.org/; Lombard 
et al., 2014). Most of them show optimal activity at neutral or 
acidic pH and at temperatures below 50°C (Maheshwari et al., 
2000). Exo-glucanases (or CBHs) act in a processive manner 
(Davies and Henrissat, 1995) and are classified only to two fam- 
ilies, as referred previously. One of the important features of all 
CBHs is that they can act on micro crystalline cellulose (Terri, 
1997). BGs include enzymes of GH1 and GH3 families that 
hydrolyze cellobiose and short (soluble) cellooligosaccharides to 
glucose that could subsequently fermented to ethanol; e.g., the 
hydrolysis reaction is performed in the liquid phase, rather than 
on the surface of the insoluble cellulose particles, such as EGs and 
CBHs. The removal of cellobiose is an important step of the enzy- 
matic hydrolysis process, as it assists in reduction of the inhibitory 
effect of cellobiose on EG and CBH. BG activity has often been 
found to be rate-limiting during enzymatic hydrolysis of cellu- 
lose (Duff and Murray, 1996; Tolan and Foody, 1999), and due 
to that the commercial cellulase enzyme preparations are often 
supplemented with BG activity. 

Until recently, only hydrolytic enzymes were thought to play a 
role in the degradation of recalcitrant cellulose and hemicelluloses 
to fermentable sugars. Recent studies demonstrate that enzymes 
from the GH family 61 show lytic polysaccharide monooxyge- 
nase activity (LPMO) and have an enhancing cellulolytic effect 
when combined with common cellulases (Horn et al., 2012). 




FIGURE 1 | Distribution of cellulolytic enzymes of M. 
thermophila throughout eight GH families. Other activities refer 
to fi-xylosidase (GH3), p-1 ,6-galactanase, p-1 ,3-glucanase, 



Together with cellobiose dehydrogenase (CDH; EC 1.1.99.18), 
an enzymatic system capable of oxidative cellulose cleavage is 
formed, which increases the efficiency of cellulases and boosts the 
enzymatic conversion of lignocellulose. It has long been thought 
that the proteins of GH family 61 are accessory proteins enhanc- 
ing cellulose decomposition. They were thus frequently referred 
to as the "cellulose enhancing factors" (Harris et al., 2010) and 
previously thought to have no or only weak endoglucanase activ- 
ity (Karlsson et al., 2001). Now, these enzymes are now reclassified 
to AA9 family of CAZy database and their mode of action provide 
a new dimension to the classical concept of cellulose degradation, 
as recently reviewed by Dimarogona et al. (2013). These copper- 
dependent enzymes were shown to cleave cellulose by an oxidative 
mechanism provided that reduction equivalents from CDH or 
low molecular weight reducing agents (e.g., ascorbate) are avail- 
able (Langston et al, 2011). In some genomes, AA9 genes even 
outnumber cellulose genes. It remains to be elucidated whether 
all of these encoded enzymes have PMO activity, but their large 
number emphasizes the importance of oxidative cellulose cleav- 
age. M. thermophila's genome has 25 AA9 genes, encoding puta- 
tive proteins acting as accessory LPMOs enzymes (Berka et al., 
2011). This number is outstanding in comparison to common 
lignocellulolytic organisms, as A. niger (seven sequences) and T. 
reesei (nine sequences). This difference can explain the high effi- 
ciency of hydrolysis of Myceliopthora in nature substrates and 
reveals the crucial role of these enzymes in the whole procedure. 

Throughout the genome of M. thermophila, there are eight 
sequences encoding EG activity, seven sequences of CBH activ- 
ity and nine sequences of BG activity (Figure 1; Table 2). The 
theoretical average molecular weight of the translated proteins is 
calculated at 51.05 ± 16.2 kDa and the theoretical pi at 5.58 ± 0.3. 
EGs are distributed to families GH5, 7, 12, and 45, all predicted 
to possess a secretion signal and several N- and O-glycosylation 



0 other activity 
dp.glucosidasc 

■ ceUobiohydrolase / fM,4-exoglucanase 
D (i- 1 ,4-endoglucanase 
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endo-1 ,4-beta-mannosidase or putative proteins with unknown 
function (GH5). GH74 represents xyloglucan specific 
1 ,4-endoglucanase/xyloglucanase. 
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sites. Only two of them exhibit a CBM that belong in family 1. 
CBHs represent three non-reducing acting enzymes of GH6 fam- 
ily and four reducing-end acting enzymes of GH7 family. All of 
these enzymes seem to be targeted to secretion pathway and mod- 
ified with glycans during post-translational modifications. BGs 
are classified to GH3 family, except one GH1 sequence, while 
none of them exhibit a CBM, as expected. Four are secreted and 
have potential N- and O-glycosylation sites, showing the high- 
est molecular weight compared to the other cellulases, with a 
theoretical average value of 85.18 ± 3.2 kDa. 

Totally, 12 cellulases have been isolated and characterized 
(Table 3). The group of Bukhtojarov et al. (2004) investigated the 
properties of individual cellulases from the multienzyme com- 
plex produced by a mutant strain of M. thermophila CI (Visser 
et al., 2011). Among EGs, the highest saccharification activity was 
displayed by EG60 and EG51, representing enzymes of 60 and 
51 kDa, respectively, which exhibited pi values of 3.6 and 5.0, 



respectively. It has been shown later that the EG51 and EG60 
represent the GH5 and GH7 EGs from M. thermophila, respec- 
tively, (Gusakov et al, 2011). A different EG (SfCel5A) displays 
a typical GH5 domain, exhibiting optimal activity at pH 6.0 and 
70°C and retained greater than 50% of its activity following 2h 
of incubation at 55° C, diluted in 10 mM citrate buffer pH 4.5 
(Tambor et al, 2012). A GH7 EG gene was functionally expressed 
in methylotrophic yeast P. pastoris and subsequently characterized 
(Karnaouri et al., 2014). Substrate specificity analysis revealed 
that the enzyme is one of the most thermostable fungal enzymes 
reported up to now and exhibits high activity on substrates 
containing P-l,4-glycosidic bonds as well as activity on xylan- 
containing substrates. Moreover, MfEG7a was proved to liquefy 
rapidly and efficiently pretreated wheat straw, indicating EGs' 
key role to the initial step of hydrolysis of high-solids lignocel- 
lulose substrates (Karnaouri et al., 2014). This change in viscosity 
of these substrates is probably due to the gradual reduction of 



Table 3 | Description of the characterized cellulolytic enzymes either isolated from the culture broth of a M. thermophila C1 mutant strain or 
expressed in a heterologous host. 



Enzyme Type of action 



GH MW-monomer pH 

opt T 0 pt (°C) pi 

(kDa) 



Gene Source 



References 



StCel5A Endoglucanase 5 

EG51 Endoglucanase 5 

MtEG7a Endoglucanase 7 

EG60 Endoglucanase 7 

EG28 Endoglucanase 12 

EG25 Endoglucanase 45 

CBH la 1,4-beta 7 
cellobiohydrolase 

CBH I la 1,4-beta 6 
cellobiohydrolase 

CBH lb 1,4-beta 7 
cellobiohydrolase 

CBH lib 1,4-beta 6 
cellobiohydrolase 

MtBgl3a beta-glucosidase 3 

Bxl5 beta-glucosidase 3 



46 

51 

65 
60 

28 

25 

65 

43 

60 

70 



6 70 

4.7 70 

5 60 

4.7 60 

5.35 60 

5.5 65 

5 ND 



5.4 



ND 



ND 



65 



ND 



ND 



90 5 70 

120 ±5 4.6 75 



ND 



4.8 



Multiple bands 
in 3.8-4.5 
3.7 



5.7 



4.5 



4.2 



ND 



5.6 



4.0 



5.2 



cDNA library-EST 
analysis, expessed in 
A. niger 

Isolated from the 
culture broth of a C1 
mutant strain 
eg7a Expressed in P. 
pastoris 

Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 
Isolated from the 
culture broth of a C1 
mutant strain 

bgl3a Expressed in P. 
pastoris 

bx!5 Homologously 
expressed in C1 



Tambor et al., 
2012 

Bukhtojarov et al., 
2004; Gusakov 
etal., 2005 
Karnaouri et al., 
2014 

Bukhtojarov 
etal., 2004 

Bukhtojarov 
etal., 2004 

Bukhtojarov et al., 
2004; Gusakov 
etal., 2005 
Bukhtojarov et al., 
2004; Gusakov 
et al., 2005 
Bukhtojarov et al., 
2004; Gusakov 
etal., 2005 
Gusakov et al., 
2007 

Gusakov et al., 
2007 

Karnaouri et al., 
2013 

Dotsenko et al., 
2012 



Most of them exhibit optimum temperatures above 60° C and pH around 5.0. 
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the average chain length of cellulose polysaccharides by endo- 
acting enzymes, such as endoglucanases. Totally, four CBHs and 
two BGs have been isolated from M. thermophila crude super- 
natant and studied. CBH lib is the product of MYCTH_66729 
gene that represents an enzyme of GH6 family, which is attached 
to polysaccharide substrate through a CBM and exhibits high 
levels of activity in comparison to other CBHs (Gusakov et al., 
2007). In the same study, the isolation of CBH lb, a GH7 fam- 
ily enzyme (MYCTH_2140736) is reported, which acts mainly 
against microcrystalline cellulose and CMC. Bukhtojarov et al. 
(2004) studied the properties of CBH la and CBH Ha, which 
are classified to GH7 and GH6 family, respectively. CBH la is 
the product of MYCTH_109566 gene, and seems to be expressed 
in two isoforms with distinct molecular weights, one exhibit- 
ing the catalytic domain owing a CMB and the other only the 
catalytic domain and part of the linker, after proteolysis. This 
enzyme is produced as a major protein of fungi's secretome (20- 
25% of the total extracellular protein) and adsorbed strongly on 
microcrystalline cellulose. It has been shown that there is a sig- 
nificant synergism between CBH lib and CBH la enzymes during 
substrate hydrolysis (Gusakov et al., 2007). 

Recently, another type of specific activity was revealed. 
Xyloglucan specific exo-f5-l,4-glucanase (Xgl74A; EC 3.2.1.155) 
is classified to GH74 family and catalyzes the hydrolysis of 
(l-4)-D-glucosidic linkages in xyloglucans aiming in the success- 
ful removal of oligosaccharides from the chain end (Grishutin 
et al., 2004). Xyloglucan is a major structural polysaccharide 
found in the primary cell walls of higher plants that inter- 
act with cellulose microfibrils via hydrogen bonds to form a 
structural network that is assumed to play a key role in cell 
wall integrity. It consists of a cellulose-like backbone of P-1,4- 
linked D-glucopyranose (D-Glcp) residues, which most of them 
are substituted at C-6 with a-d-Xylp-(l— >6) residues, to which 
other saccharides may be attached (most frequently, d-Galp and 



1-Fucp). M. thermophila was found to produce an exo-P-1,4- 
glucanase (Xgl74A) (Grishutin et al, 2004) with high specific 
activity toward tamarind xyloglucan, and very low or absent 
activity against carboxymethylcellulose (CMC) and barley (3- 
glucan. Due to its unique substrate specificity the enzyme was 
given a new number in the Enzyme Nomenclature (EC 3.2.1.155). 
Apart from Xgl74A, two out of the seven cellulases reported from 
M. thermophila (Cell2A and Cel45A) possess a notable activity 
against xyloglucan, together with their major activities toward 
CMC and barley fi-glucan (Bukhtojarov et al, 2004). 

HEMICELLULOLYTIC SYSTEM 

Hemicellulose polymers have a much more diverse structure than 
cellulose and consequently several enzymes are needed to com- 
pletely degrade the polysaccharide into monosaccharides. Xylan 
that is the major component of hemicellulose in the plant cell 
wall, is consisted of a P-D-( 1,4) -linked xylopyranosyl backbone, 
which, depending on the origin, can be substituted with arabino- 
furanosyl, 4-0-methylglucopyranosyl, feruloyl and acetyl groups 
(Shibuya and Iwasaki, 1985). Feruloyl groups can form strong 
networks through peroxidase-catalyzed oxidative coupling form- 
ing diferuloyl bridges (Topakas et al., 2007). The main enzymes 
needed for depolymerization are xylanases, assisted by accessory 
enzymes such as P-xylosidases and different arabinofuranosidases 
making the xylan backbone more accessible (Sorensen et al., 
2007). Other accessible enzymes that enhance xylan degrada- 
tion are acetyl-xylanesterases (Poutanen et al., 1990), ferulic acid 
esterases (Topakas et al., 2007), and a-glucuronidases (De Vries 
et al., 1998). M. thermophila's hemicellulase genes are organized 
in 10 GH families (3, 10, 11, 30, 43, 51, 62, and 67) (Figure2) 
and nine carbohydrate esterase (CE) families (1, 3, 4, 5, 8, 9, 
12, 15, and 16) (Figure 3). Many of the encoding proteins have 
been isolated from the WT culture supernatant or expressed in 
heterologous hosts and finally characterized in terms of specific 
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1=1 other activity 
Oa-glucuronidase 
B P-l,4-xylanase 
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FIGURE 2 | Distribution of hemicellulolytic enzymes of M. thermophila throughout nine GH families. Other activities refer to (S-glycosidase (GH3), 
xylanase with endo-exo mode of action and xyiobiohyrolase (GH30), and galactan 1 ,3-beta-galactosidase (GH43). 
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FIGURE 3 | Distribution of hemicellulolytic enzymes of M. thermophila 
throughout nine CE families. Family CE4 is comprised of putative 
proteins with polysaccharide deacetylase activity, CE5 of cutinases and 



activity and physicochemical properties. The majority of them are 
predicted to follow the secretion pathway, while modified with N- 
and/or O- glucans, comprising a total amount of 66 enzymes that 
act synergistically for the degradation of hemicellulose. 

XYLANASES/XYLOSIDASES 

The degradation of xylan requires the concerted action of a 
number of powerful enzymes with varying specific activities, 
including xylanases and P-xylosidases. Xylanases (endo-l,4-|i- 
xylanases, EC 3.2.1.8) are enzymes hydrolyzing P-l,4-glycosidic 
linkages in the backbone of xylans, while most of them belong 
to GH family 10 or 11 based on amino acid similarities and 
structural features (Henrissat, 1991). GH10 xylanases exhibit 
less substrate specificity than GH11 enzymes and can hydrolyze 
different types of decorated xylans, while GH11 xylanases are 
highly specific and do not tolerate many decorations on the 
xylan backbone (Biely et al, 1997). P-Xylosidases (EC 3.2.1.37) 
hydrolyze the soluble xylo-oligosaccharides and xylobiose from 
the non-reducing end liberating xylose, produced by the activ- 
ity of xylanases. These enzymes play an important role in xylan 
degradation by relieving the end product inhibition of endoxy- 
lanases (Knob et al., 2010). The genome of M. thermophila 
encodes totally 12 xylanases with endo- mode of activity, clas- 
sified to GH 10 and 11 and four xylosidases, classified to GH3 
and 43 families (Table 4). All aforementioned xylan-degrading 
translated sequences, apart from three, are predicted to exhibit a 
potential secretion signal. Xylanases possess 1-3 N-glycosylation 
and several O-glycosylation sites, whereas more N-sites are pre- 
dicted for xylosidases, though not all of them are glycosylated 
during post-translational modifications. GH family 30 con- 
tains two genes encoding xylanolytic enzymes with endo-exo 
activity and one sequence for a characterized xylobiohydrolase, 
releasing xylobiose units from the substrate (Emalfarb et al., 
2012). 



D other activity 
B glucuronoyl esterase 
n acetyl esterase 
■ feruloyl esterase 




CE8, 12 of pectin esterases. ND (not determined) refers to sequences 
encoding putative proteins with unknown activity which are not classified 
to a specific family. 



Ten xylanases have been purified and characterized from 
multienzyme preparations of M. thermophila modified strains 
(Ustinov et al., 2008; van Gool et al., 2013). Four of them, belong- 
ing to GH10 family (Table 5), are the products of two genes, 
either with the presence of a family 1 CBM or displaying only 
the catalytic domain after partial proteolytic digestion (Ustinov 
et al., 2008). These enzymes, thought classified to the same fam- 
ily, can hydrolyze different types of decorated xylans. They differ 
in degradation of high and low substituted substrates and the 
substitution pattern seems to be an important factor influencing 
their efficiency (van Gool et al., 2012). Six xylanases, belonging to 
GH11 family, represent true xylanases, with high specific activ- 
ities against glucuronoxylans and arabinoxylans. Four of these 
enzymes exhibit lower thermostability in comparison to GH10 
xylanases, in which extended glycosylation has been noticed 
(Ustinov et al., 2008). One showed a substrate specificity pat- 
tern similar to GH10 enzymes and secreted in two forms, with 
or without CBM (van Gool et al, 2013). 

ARABIN0HYDR0LASES 

L-arabinose is widely present in various hemicellulosic biomass 
components, such as arabinoxylan, where the main P-D-(l,4)- 
linked xylopyranosyl backbone is substituted with arabinose 
residues. a-L-arabinofuranosidases (AFase; EC 3.2.1.55) are 
enzymes that release arabinofuranose residues substituted at posi- 
tion 0-2 or 0-3 of mono or di-substituted xylose residues 
(Gruppen et al., 1993). Apart from that, AFases act in syner- 
gism with other arabinohydrolases, endo-(l,5)-a-L-arabinanases 
(ABNase; EC 3.2.1.99) for the decomposition of arabinan, a 
major pectin polysaccharide. Arabinan consists of a backbone of 
a- (1,5) -linked L-arabinofuranosyl residues, some of which are 
substituted with a-(l,2)- or a-(l,3)-linked arabinofuranosides 
(Weinstein and Albersheim, 1979). Degradation of arabinan poly- 
mer to arabinose sugars is driven by the synergistic action of 
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Table 5 | Description of the characterized |i-1,4-xylanases isolated from the culture broth of a M. thermophila C1 mutant strain. 



Enzyme 


CAZy module 


MW-monomer (kDa) 


pHopt 


Topt PC) 


Pi 


Gene 


References 






Ayn iua 


un IU 


4z/o I 


k k in 


DO— / U 


"7 Q/Q O 

/.o/o.y 


VI lit 

xyl i 


Ustinov et al., 


zuuo, van uooi et ai. 


zU Iz 


XynlOB 


GH10 


57/46* 


5.5-7.0 


80-85 


4.4/4.3 


xyl3 


Ustinov et al., 


2008; van Gool et al.. 


2012 


XynlOC 


GH11 


40 


5.0 


80 


4.8 


xyl4 


Ustinov et al., 


2008 




Xyn11A 


GH11 


24 


6.5 


70 


7.9 


xyl2 


Ustinov et al., 


2008 




Xyn11B 


GH11 


23 


6.0-6.5 


65-70 


8.4 


xyl6 


Ustinov et al., 


2008 




Xyn11C 


GH11 


22 


4.5 


65 


6.7 


xyl5 


Ustinov et al., 


2008 




Xyl7 


GH11 


22/30* 


5.5-6.5 


50-60 


7.3/7.6 


xyl7 


van Gool et al 


, 2013 




Xyl8 


GH11 


22 


5.5-6.0 


50-65 


6.2 


xyl8 


van Gool et al 


, 2013 





They display optimal activity at temperatures between 50 and 70° C, increasing fungi's hydrolytic efficiency in various temperatures. Marked proteins (*) were 
isolated in two different forms, with (high molecular weight enzyme) or without CBM (low molecular weight enzyme). 



two major enzymes, AFases and ABNases (Kim, 2008). AFases 
specifically catalyze the hydrolysis of terminal non-reducing 
L-arabinofuranosyl residues from arabinan, while the result- 
ing debranched backbone could be efficiently hydrolyzed by 
endo-acting ABNases, thus generating a variety of arabino- 
oligosaccharides with an inverting mode of action (Beldman et al., 
1997). Thoughout CAZy families, arabinohydrolases belong to 
the GH family 43, 51, 54, 62, and 93 (Figure 3). 

The genome of the M. thermophila encodes 14 enzymes that 
putatively release arabinose or arabinose oligomers from arabinan 
(Hinz et al, 2009). Eleven sequences contain a secretion signal 
peptide and produced as extracellular or cell-bounded proteins, 
while almost all of them exhibit isoelectric point around 4.6- 
5.6 (Table 6). Seven of them have been selectively overexpressed 
homologously in M. thermophila CI host and found to release 
arabinose from wheat arabinoxylan polymers and oligomers 
(Hinz et al., 2009). M. thermophila arabinofuranosidases are 
selective in releasing arabinose from either single or double sub- 
stituted xylose residues in arabinoxylans. Eight enzymes, belong- 
ing to GH families 43, 51, 62, and 93 with different type of 
arabinolytic activity have been purified and characterized (Hinz 
et al, 2009; Kiihnel et al., 201 1; Pouvreau et al, 2011b) (Table 7). 

Abn7 and Abf3 are GH43 and GH51 arabinases respec- 
tively, which were selectively produced in CI host. Abn7 was 
found to hydrolyze arabinofuranosyl residues at position 0-3 
of double substituted xylosyl residues in arabinoxylan-derived 
oligosaccharides, while Abf3 released arabinose from position 
0-2 or 0-3 of single substituted xyloses. When these enzymes 
were incubated together, in combination with a GH10 endo- 
xylanase for the hydrolysis of arabinoxylans, they resulted in 
a synergistic increase in arabinose release from the substrate 
(Pouvreau et al., 2011b). In addition, a-L-arabinohydrolases 
Abnl, Abn2, and Abn4 were overexpressed in CI and the pro- 
duced culture supernatant has been shown to produce neu- 
tral branched arabino-oligosaccharides from sugar beet arabinan 
by enzymatic degradation. As found by sugar analysis, neutral 
arabino-oligosaccharides contained an ct-(l,5)-linked backbone 
of 1-arabinosyl residues and carried single substituted a- (1,3)- 
linked 1-arabinosyl residues or consisted of a double substi- 
tuted a- (1,2,3, 5) -linked arabinan structure within the molecule 
(Westphal et al, 2010). Enzyme Abn4 belongs to GH43 family 
and is more active toward branched polymeric arabinan substrate 



that releases arabinose monomers from single substituted ara- 
binose residues, while Abnl and Abn2 are active toward linear 
arabinan (Kiihnel et al., 2010). Abn2 is a member of GH93 family 
that consists of exoarabinases acting on linear arabinan, hydrolyz- 
ing the ct-l,5-linkages of arabinan polysaccharides presented as 
side chains of pectin. Their mode of action was studied with 
Abn2, which binds two arabinose units at the subsites — 1 and —2 
and releases arabinose. Three more arabinohydrolases were also 
overexpressed in CI strain (Hinz et al., 2009). Abn5 was found 
to be specifically active toward arabinan, but not arabinoxylan. 
Arabinofuranosidases Abfl and Abf2, members of GH62 family 
released 0-2 or 0-3 substituted arabinose or linked arabinofu- 
ranosyl from mono substituted xylose. GH family 62 arabino- 
furanosidases are reported to be predominantly active toward 
arabinoxylan and are, therefore, also called arabinoxylan arabino- 
furanohydrolases (Beldman et al., 1997). Several of these enzymes 
contain either a CBM1, like Abfl, or a CBM43 (xylan) -binding 
domain. 

ESTERASES 

The role of esterases in the breakdown of lignocellulosic mate- 
rial is complex and includes the cleavage of bonds between the 
main hemicellulose part and many types of side chains. So, upon 
a closer examination of the genome sequences of M. thermophila, 
there is a wide distribution of enzymatic activities through CE 
families. These enzymes are classified into nine families and their 
main activities, among others, include the hydrolysis of feruloyl 
and acetyl ester bonds. 

Feruloyl esterases (FAEs; EC 3.1.1.73) are enzymes responsi- 
ble for cleaving the ester-link between the polysaccharide main 
chain of xylans and monomeric or dimeric ferulates. They act 
synergistically with xylanases to release ferulic acid from cell- 
wall material and can be divided into four groups, namely 
A-D. The main difference between groups A and D is their 
substrate specificity toward synthetic substrates and their capa- 
bility of liberating diferuloyl bridges (Crepin et al., 2004). One 
of the first FAEs reported from thermophilic fungi, was pro- 
duced from M. thermophila under solid-state fermentation (SSF) 
conditions. The esterase activity was isolated and partially char- 
acterized for its ability to release ferulic acid from complex 
substrate, destarched wheat bran (Topakas et al., 2003). Two 
other FAEs, SfFaeB, a protein with molecular weight of 66 kDa 
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Table 7 | Description of the characterized endoarabinases and arabinofuranosidases isolated from the culture broth of a M. thermophila C1 
mutant strain. 



Enzyme CAZy Mode of action 

module 



Abf1 GH62 



MW- pH 0 pt T op t Uniprot/ Gene Source 

monomer (°C) SwissProt 

(kDa) ID 



Arabinofuranosidase/ 
releases 0-2 or 0-3 
arabinose from 
mono-substituted 
xylose 



abfl 



Selectively 
produced in 
C1-host 



References 



Hinz et al., 2009 



Abf2 GH62 



Arabinofuranosidase/ 
releases 0-2 or 0-3 
linked 

arabinofuranosyl 
residues from 
mono-substituted 
xylose 



abf2 



Selectively 
produced in 
C1-host 



Hinz etal., 2009 



Abf3 GH51 



Abn7 



GH43 



Arabinofuranosidase/ 70 
releases arabinose 
from the 

non-reducing end of 
reduced arabinose 
oligomers 

Arabinofuranosidase/ 70 
releases 0-3 linked 
arabinofuranosyl 
residues from 
di-substituted xylose 



5 40 HQ324254 



abf3 



40 



HQ324255 



abr>7 



Selectively 
produced in 
C1-host 



Selectively 
produced in 
C1-host 



Pouvreau et al., 2011b 



Pouvreau et al., 2011 b 



Abn1 



GH43 



Endoarabinase 



36 5.5 60 HQ324251 abnl Overexpressed Kuhnel et al., 2011 

in fermentation 
supernatant 



Abn2 GH93 Exoarabinase/ 40 

arabinobiose from 
the non-reducing end 
of reduced arabinose 
oligomers 

Abn4 GH43 Arabinofuranosidase/ 33 

releases arabinose 
from the 

non-reducing end of 
reduced arabinose 
oligomers 



50 



abn2 Overexpressed Kuhnel et al., 2011 
in fermentation 
supernatant 



5.5 



60 



HQ324253 



abr>4 Overexpressed 
in fermentation 
supernatant 



Kuhnel etal., 2011 



Abn5 



Arabinofuranosidase 



abn5 



Selectively 
produced in 
C1-host 



Hinz etal., 2009 



(homodimers of 33 kDa) (Topakas et al., 2004) and StFaeC, 
46 kDa (homodimers of 23 kDa) (Topakas et al., 2005), were 
purified to homogeneity from culture supernatants of M. ther- 
mophila. SfFaeB hydrolyzed methyl p-coumarate, methyl caffeate 
and methyl ferulate and was active on substrates containing 
ferulic acid ester linked to the C-5 and C-2 linkages of ara- 
binofuranose. StFaeC showed maximum catalytic efficiency on 
4-hydroxy-3-methoxy cinnamate, a substrate with both hydroxyl 



and methoxy substituents, indicating that it may be the most 
promising type of FAE as a biocatalyst for the enzymatic feruloy- 
lation of aliphatic alcohols, oligo- and polysaccharides. Properties 
of characterized FAEs are summarized in Table 8. Among the 
sequences registered to Genome Portal, there are four sequences 
encoding proteins with catalytic activity of FAE, all belonging 
to CE family 1. Two of them (MYCTH_48379, MYCTH_39279) 
seem to be identical with characterized FAEs secreted from M. 
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Table 8 | Description of M. thermophila characterized esterases (FAEs, AcEs, and GEs). 



Enzyme Type MW-monomer (kDa) pH op t T op t (°C) pi Gene Source References 



FAEs 



StFaeB 


B 


33* 


6 


55-60 


3.5 




Isolated from culture supernatant 


Topakas et al., 2004 


StFaeC 


C 


23* 


6 


55 


<3.5 




Isolated from culture supernatant 


Topakas etal., 2005 


FAE 


ND 


27 


8 


60 


5 




Isolated from culture supernatant 


Topakas et al., 2003 


FaeA1 


A 


29 


6.5 


45 


w5.5 




Overexpressed in fermentation supernatant 


Kuhnel etal., 2012 


FaeA2 


A 


36 


7.5 


40 


s»5.2 




Overexpressed in fermentation supernatant 


Kuhnel etal., 2012 


FaeB2 


B 


33 


7.5 


45 


**6.0 




Overexpressed in fermentation supernatant 


Kuhnel etal., 2012 


MtFaela 


B 


39 


7 


50 


ND 


faela 


Expressed in P. pastoris 


Topakas et al., 2012 




MtAxe3 


1 


33.6 


7 


40 


ND 


axe3 


Overexpressed in fermentation supernatant 


Pouvreau et al., 2011 


MtAxe2 


5 


23.6 


7 


40 


ND 


axe2 


Overexpressed in fermentation supernatant 


Pouvreau et al., 2011 


GEs 


















StGE2 


15 


43 


7 


55 


ND 


ge2 


Expressed in R pastoris 


Topakas etal., 2010 


StGE1 


15 


58 


6 


60 


ND 




Isolated from culture supernatant 


Vafiadi et al., 2009 



*AII proteins are monomeric, while in case of FaeB2, dimeric structures are detected, after comparing the results of SDS-PAGE and native electrophoresis. 



thermophila CI strain (Kuhnel et al., 2012). One sequence (JDI 
ID: 96478) has been heterologously expressed in P. pastoris and 
encoded a 39 kDa protein (faelA; MtFaela), which showed high 
activity toward methyl caffeate and p-coumarate and a strong 
preference for the hydrolysis of n-butyl and iso-butyl ferulate 
(Topakas et al., 2012). In addition, MtFael esterase release fer- 
ulic acid from destarched wheat bran only by the synergistic 
action of an endo-xylanase (a maximum of 41% total ferulic 
acid released after 1 h incubation). MYCTH_2302953 sequence 
has not yet been characterized, however it still shows 66% iden- 
tity with a type B FAE from Neurospora crassa (CAC05587.1). All 
proteins encoded by the above sequences appear to be secreted 
and bring several N- and O-glycosylation sites, as shown in 
Table 9. 

About 60-70% of the xylose residues in hardwood xylan 
are acetylated at the C2 and/or C3 positions (Lindberg et al, 
1973). The complete degradation of acetylated xylans by microbes 
requires the action of acetyl esterases (AcEs; EC 3.1.1.72), which 
cleave acetyl side groups from the heteroxylan backbone, and 
act in synergy with other hemicellulases (Tenkanen et al, 1996). 
Eight sequences that encode proteins with AcE activity were 
detected in the genome of M. thermophila and showed identity 
with characterized enzymes. All of them are secreted, as predicted 
with SignalP and belong to CE families 1, 3, 5, 16 (Table 9). 
Two of them, Axe2 and Axe3, which bare members of CE5 
and CE1 families, respectively, were isolated and characterized 
(Pouvreau et al, 2011a). Annotated genes, encoding the putative 
enzymes were cloned into the specially designed M. thermophila 
Cl-expression host (Verdoes et al, 2010) and over-produced in 
the culture medium. Axe2 and Axe3 are able to hydrolyze acetyl 
groups when they are substituted to the 0-2 and 0-3 positions 
of acetylated xylo-oligosaccharides and complex insoluble poly- 
meric substrates and had a preference for xylooligosaccharides 
(Pouvreau et al., 201 la). 

Glucuronoyl esterases (GEs) are recently discovered enzymes 
that are suggested to play an important role in the dissocia- 
tion of lignin from hemicellulose and cellulose by cleaving the 
ester bonds between the aromatic alcohols of lignin and the 



carboxyl groups of 4-O-methyl-D-glucuronic acid residues in 
glucuronoxylan (Spanikova and Biely, 2006). Sequence alignment 
studies of these enzymes have revealed a novel conserved amino 
acid sequence G-C-S-R-X-G that features the characteristic ser- 
ine residue involved in the mechanism of this esterase family. 
It has been shown that the mode of action probably involves a 
nucleophilic serine (Topakas et al, 2010). The genome of M. ther- 
mophila possesses two genes classified to family CE15 that encode 
proteins with activity of 4-O-methyl-glucuronoyl esterase. Both 
putative enzymes are secreted and have potential glycosylation 
sites. The first GE (SfEGl), isolated from the culture filtrate of 
M. thermophila, was proved to be a thermophilic enzyme that 
presents a C-terminal CBM, which was active on substrates con- 
taining glucuronic acid methyl ester (Vafiadi et al, 2009). Another 
CE15 protein molecule, SfGE2 was heterologously expressed in 
yeast P. pastoris and was used to prove that nucleophilic serine 
residue is responsible for catalytic action of GEs, through site- 
directed mutagenesis studies (Topakas et al., 2010) and crystal 
structure determination (Charavgi et al, 2013). 

MANNAN-DEGRADING ENZYMES 

Mannan is a great component of hemicellulose, therefore, as 
expected, the lignocellulolytic toolbox of M. thermophila pos- 
sesses a complete reservoir of genes encoding mannan degrading 
enzymes. Mannan polymer primarily consists of a backbone 
structure composed of P-l,4-bound mannose residues or combi- 
nation of glucose and mannose residues and can be hydrolyzed to 
its monomers with the synergistic action of P-mannanases (EC 
3.2.1.78), P-mannosidases (EC 3.2.1.25), ct-galactosidases (EC 
3.2.1.22), and acetylmannan esterases (E.C. 3.1.1.6) (McCleary, 
1988). The genome of M. thermophila encodes three enzymes that 
putatively catalyze random cleavage of the mannan polysaccha- 
ride and belong to GH family 5 and 26. One of these enzymes has 
been isolated from culture supernatant, characterized and classi- 
fied as GH5 endo-f5-l,4-mannosidase (bMan2, Dotsenko et al., 
2012). In addition, there are two genes encoding putative P- 
mannosidases belonging to GH2 family, while one of them has 
been characterized in terms of its specificity and physicochemical 
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properties (bMann9, Dotsenko et al., 2012). Two GH27 and 
one GH26 a-galactosidases boost the efficiency of fungal culture 
supernatant against hydrolysis of mannan substrate (Emalfarb 
et al., 2012), while two CE12 family genes encoding proteins 
with high similarity to known acetyl-mannan esterases have been 
found. 

AUXILIARY ENZYMES 

In spite of the cooperative activity exhibited by the cellulolytic 
and hemicellulolytic enzymes, the impressive hydrolytic ability 
of various microorganisms in nature cannot be attributed only 
to this endo-exo mechanism. Apart from the hydrolytic sys- 
tem responsible for carbohydrate degradation, it seems that an 
oxidative system catalyze lignin depolymerization and oxidation 
of plant cell wall components, yielding reactive molecules (e.g., 
H2O2). Recent evidence highlights the critical role of alternative 
enzymatic partners involved in the oxidation of cell wall compo- 
nents. Among these enzymes, outstanding role during hydrolysis 
exhibit the originally described as cellulases LPMO enzymes, 
CDH and multicopper enzymes such as laccases. The genome of 
M. thermophila possesses more than 30 genes that encode pro- 
teins with such auxiliary activities (Figure 4). Members of the 
LPMO family AA9, have been shown to be copper-dependent 
monooxygenases that enhance cellulose degradation in concert 
with classical cellulases, as aforementioned before and reviewed 
by Dimarogona et al. (2013). These enzymes catalyze the cleavage 
of cellulose by an oxidative mechanism provided that reduc- 
tion equivalents are available. These equivalents either involve 
low molecular weight reducing agents (e.g., ascorbate) or are 
produced by CDH activity (Langston et al., 2011). CDHs are 
extracellular enzymes produced by various wood-degrading fungi 
that oxidize soluble cellodextrins, mannodextrins and lactose 
efficiently to their corresponding lactones by a ping-pong mech- 
anism using a wide spectrum of electron acceptors (Henriksson 
et al., 2000). Throughout the genome of M. thermophila, two 
genes encoding proteins classified to AA3 and 8 families have been 
identified (Figure 4). Both of them are predicted to be secreted 
in the culture supernatant and have potential glycosylation sites. 
The translated CDH MYCTH_1 1 1388 exhibits a C-terminal CBM 
and a cDNA clone of this sequence has been isolated and bio- 
chemically characterized by screening an expression library of 
M. thermophila (Subramaniam et al, 1999). Canevascini et al. 
(1991) purified a monomeric (91 kDa) and a dimeric (192 kDa) 
form of CDH that differed not only in molecular weight, 
but amino acid composition and carbohydrate content. Both 
forms oxidized cellobiose in the presence of cytochrome c or 
dichlorophenol-indophenol. 

Laccases (EC 1.10.3.1) are multicopper enzymes that catalyze 
the oxidation of a variety of phenolic compounds, with con- 
comitant reduction of O2 to H2O. These polyphenol oxidases are 
produced by most ligninolytic basidiomycetes (Baldrian, 2006) 
and can degrade lignin and other recalcitrant compounds in 
the presence of redox mediators (Ruiz-Duenas and Martinez, 
2009). The genome of the M. thermophila encodes eight puta- 
tive enzymes with multicopper oxidase activity. Four of them have 
been annotated and one (MYCTH_51627) matches the led gene 
product encoding an extracellular laccase (Berka et al., 1997). 



Four sequences are predicted to possess a secretion signal, while 
one appears to remain membrane-bound. Lccl gene has been iso- 
lated from fungi's genome, heterologously expressed in A. oryzae 
and the produced 85 kDa enzyme (MtL) was characterized as a 
thermostable low oxidation potential laccase with high reactivity 
in aqueous medium at room temperature and neutral pH. MtL 
was tested for its capacity to catalyze enzymatic oxidation of sev- 
eral phenolic and polyphenolic compounds (ferulic acid, gallic 
acid, caffeic acid, and catechin) (Mustafa et al., 2005). M. ther- 
mophila laccases have been reported to oxidize lignin surface, by 
increasing the amount of radicals during thermomechanical pulp 
fiber material bleaching (Gronqvist et al., 2003) and promote 
oxidative polymerization of Kraft lignin from back liquor, which 
is the main by-product of pulp and paper industry (Gouveia et al., 
2013). 

LIGN0CELLUL0SIC POTENTIAL— STATISTICS 

M. thermophila is a powerful lignocellulolytic organism, which 
secretes a complex system of carbohydrate hydrolases for the 
breakdown of cellulose and hemicellulose, as well as oxidore- 
ductases embedded in lignin degradation. Genome analysis in 
this review revealed 30 genes encoding cellulases classified to 10 
GH families, 66 genes encoding hemicellulases classified to 10 
GHs, 9 CEs and 35 genes encoding auxiliary enzymes. The lat- 
ter include CDHs (AA3/AA8 family), LPMOs (AA9 family) and 
multicopper oxidases (laccases or laccase-like enzymes). Out of 
the total consortium of M. thermophila sequences encoding pro- 
teins with putative lignocellulosic activity, 80.2% are predicted 
to have a secretion signal peptide. Almost 76% of cellulases, 
hemicellulases and 88% of the accessory redox enzymes are tar- 
geted to secretion pathway, while only a very small amount 
remain inside the cell or represent membrane cell — bound 
macromolecules. Only 15.8% of the secreted enzymes in this 
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FIGURE 5 | Theoretical molecular weight of secreted enzymes of M. 
thermophila classified in several GHs and CEs families, plotted against 
theoretical pi. The average molecular weight was calculated at 51.05 ± 



16.2 kDa (range between 21 and 97 kDa) for cellulolytic enzymes, 35.5 ± 
19.5 kDa (range between 22 and 89 kDa) for hemicellulases (GHs/CEs), and 
28.51 ± 4.1 kDa (range between 23 and 39 kDa) for the fraction of esterases. 



review are predicted to possess a CBM and the majority of them 
comprise of auxiliary enzyme activities. The theoretical average 
molecular weight of secreted enzymes is 41.36 ± 15.9, vary- 
ing between 10 and 97 kDa. The majority of secreted enzymes 
have molecular weight varying between 20 and 50 kDa, whereas 
fi-xylosidases and f5-glycosidases (GH3 family), and arabinofura- 
nosidases (GH43 and GH51) appear to be high molecular weight 
proteins (Figure 5). The theoretical average isoelectric point of 
secretory enzymes is calculated 5.27 ± 0.8, at a range 4.34-7.9. In 
vivo expression and study of these enzymes would give different 
results, as the proteins are glycosylated, so size and pi value tend to 
moderate. 

PROTEIN GLYC0SYLATI0N 

A total proportion of 92.8% of secreted proteins have either N- 
or O- putative glycosylation sites. These proteins are often glyco- 
sylated due to the existence of many Asn-Xaa-Ser/Thr sequons, 
which are known to be a prerequisite for N-glycosylation post- 
translational modifications. The molecules of many GHs and 
accessory enzymes have a modular structure consisting of a cat- 
alytic module, flexible peptide linker, and CBM. Flexible linker 
peptides, which are rich in Ser and Thr residues, are typically 
O-glycosylated (Gilkes et al., 1991). The N-glycosylation seems 
to be restricted to the catalytic modules, and it is usually absent 
in other parts of enzyme molecules. Various N-linked glycan 
structures have been found in different enzymes from M. ther- 
mophila, belonging to different enzyme classes and protein fami- 
lies (Gusakov et al., 2008). It has been noticed that glycosylation 
follows a heterogeneity pattern, meaning that in some molecules, 
the same Asn residue was modified with oligosaccharides hav- 
ing different structure, while not all of the potential glycosylation 
sites were found to be occupied. The most frequently met N- 
linked glycan was (Man)3(GlcNAc)2, a pentasaccharide which 



represents a well-known conserved core structure that forms 
mammalian-type high-mannose and hybrid/complex glycans in 
glycoproteins from different organisms (Dwek et al., 1993). Both 
types of glycosylation occur in 65% of secreted cellulases, 62.1% 
of secreted hemicellulases, while only O-glycosylation patterns 
appear in most of accessory enzymes. The presence of N-linked 
glycans is common for catalytic domain of the enzymes, while 
O-glycosylation usually occurs in linker region. Even though pre- 
dicted to, non-secreted enzymes are not modified in vivo with gly- 
cans, since this procedure has been noticed as a post-translational 
modification in proteins targeted to the secretory pathway of the 
cell (Blom et al, 2004). 

CONCLUSIONS 

Rapid depolymerization of lignocellulosic material is a distin- 
guishing feature of thermophilic fungi, such as M. thermophila, 
which was isolated from soil and self-heating masses of com- 
posted vegetable matter (Domsch et al., 1993). However, the 
precise biochemical mechanisms and underlying genetics of this 
procedure are not completely understood. Systematic examina- 
tion of the M. thermophila genome revealed a unique enzy- 
matic system comprising of an unusual repertoire of auxiliary 
enzymes, especially those classified to AA9 family, and provided 
insights into its extraordinary capacity for protein secretion. 
The current review constitutes, to the best of our knowledge, 
the first genomic analysis of the lignocellulolytic system of M. 
thermophila. The genomic data, along with the observed enzy- 
matic activity of several isolated and characterized enzymes 
suggest that this fungus possesses a complete set of enzymes, 
including 30 cellulases, 66 hemicellulases, and 35 proteins with 
auxiliary auxiliary enzymes, covering the most of the recog- 
nized CAZy families. From its cellulases to its oxido-reductases 
and multicopper enzymes, M. thermophila gene complement 
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represents several avenues for further research and its diverse 
array of enzymatic capabilities will contribute to the study of 
lignocellulose degradation and the subsequent ethanol biofuel 
production. 
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